Some Experiments on the Use of One-channel Noise Reduction Techniques with the Italian Speechdat Car Database
نویسندگان
چکیده
In this work the use of noise reduction techniques for handsfree speech recognition in car environment is investigated. A set of experiments was carried out using different speech enhancement algorithms based on noise estimation. In particular, linear subtraction and MMSE estimators are considered in their various configurations, which depend on a different set of parameters. Experiments were conducted on connected and isolated digits, extracted from the Italian version of the SpeechDatCar database. As a result, spectral subtraction with a suitable choice of the oversubtraction factor led to more than 30% relative performance improvement, from 94.39% to 96.15% digit recognition accuracy.
منابع مشابه
Feature vector selection to improve ASR robustness in noisy conditions
It is well known that noise reduction schemes are beneficial in ASR to reduce training-test mismatch due to noise. However, a significant mismatch may still remain after noise reduction, especially in the non-speech portions of the signals. To reduce the impact of this mismatch, two methods for discarding non speech acoustic vectors at recognition time are investigated: variable frame rate pro...
متن کاملThe speechdat-car multilingual speech databases for in-car applications: some first validation results
The main objective of SpeechDat-Car is to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334. The duration of the project is 30 months. Equivalent and similar resources for nine languages will be created: Danish, English, ...
متن کاملSPEECHDAT-CAR. A Large Speech Database for Automotive Environments
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...
متن کاملSpeechDat-Car Fixed Platform
SpeechDat-Car aims to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car while the second type is composed by GSM signals transmitted from the car and recorded simultaneously in a far-en...
متن کاملQuantile based histogram equalization for online applications
The noise robustness of automatic speech recognition systems can be increased by transforming the signal to make the cumulative density functions of the signal’s values in recognition match the ones that where estimated on the training data. This paper describes a real–time online algorithm to approximate the cumulative density functions, after Mel scaled filtering, using a small number of quan...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001